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Abstract. We introduce the neural network approach to global fits of parton distribution functions. 
First we review previous work on unbiased parametrizations of deep-inelastic structure functions 
with faithful estimation of their uncertainties, and then we summarize the current status of neural 
network parton distribution fits. 

1. INTRODUCTION 

The requirements of precision physics at hadron colliders have recently led to a rapid 
improvement in the techniques for the determination of parton distributions of the 
nucleon [ 1 ] . Specifically it is now mandatory to determine accurately the uncertainty 
on these quantities. The main difficulty is that one is trying to determine the uncertainty 
on a function, that is, a probability measure in a space of functions, and to extract it from 
a finite set of experimental data, a problem which is mathematically ill-posed 

The shortcomings of the standard approach to global parton fits are well known: the 
bias introduced by choosing fixed functional forms to parametrize the parton distribu- 
tions (also known as model dependence), the problems to assess faithfully their uncer- 
tainties, the combination of inconsistent experiments, and the lack of general, process- 
independent error propagation techniques. Although the problem of quantifying the un- 
certainties in pdfs has seen a huge progress since its paramount importance was raised 
some years ago, until now no unambiguous conclusions have been obtained. 

Here we present a novel strategy to address the problem of constructing unbiased 
parametrizations of parton distributions with a faithful estimation of their uncertainties, 
based on a combination of two techniques: Monte Carlo methods and neural networks. 
First we review recent work on the related problem of the construction of bias-free 
parametrizations of structure functions from experimental data, and then we turn to the 
application of our strategy to parton distributions. 



1 Talk given at Deep Inelastic Scattering 2005 Workshop (Madison) by J. R., on behalf of the NNPDF 
Collaboration. 



2. STRUCTURE FUNCTIONS 



The strategy presented in |0, 0] to address to problem of parametrizing deep-inelastic 
structure functions F(x,Q 2 ) is a combination of two techniques: first we construct a 
Monte Carlo sampling of the experimental data (generating artificial data replicas), and 
then we train neural networks in each data replica, to construct a probability measure 
in the space of structure functions [F(x, Q 2 )]. The probability measure constructed 
in this way contains all information from experimental data, including correlations, 
with the only assumption of smoothness. Expectation values and moments over this 
probability measure are then evaluated as averages over the trained network sample, 
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where & [F] is an arbitrary function of F(x, Q 2 ). 

The first step is the Monte Carlo sampling of experimental data, generating N iep 
replicas of the original N^t experimental data, 
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where r are gaussian random numbers with the same correlation as the respective 
uncertainties, and <J stat ,<7 sys ,On are the statistical, systematic and normalization errors. 
The number of replicas _/V re p has to be large enough so that the replica sample reproduces 
central values, errors and correlations of the experimental data. 

The second step consists on training a neural network on each of the data replicas. A 
neural network [4] (see fig. [B is a highly nonlinear mapping between input and output 

patterns as a function of its parameters (the so-called weights OO^and thresholds (y ). 
Neural networks are specially suitable to parametrize parton distributions since they 
are unbiased, robust approximants and interpolate between data points with the only 
assumption of smoothness. The neural network training consist on the minimization for 
each replica of the x 2 defined with the inverse of the experimental covariance matrix, 

2(*) = 1 y' fp(«t)(k) _ F (net)(*)\ cov -l / f (art)(l) _ F (nct)(k)\ _ (3) 
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Our minimization strategy is based on Genetic Algorithms |5], which are specially suited 
for finding global minima in highly nonlinear minimization problems. 

The set of trained nets, once is validated through suitable statistical estimators, be- 
comes the sought-for probability measure [F(x, Q 2 )~\ in the space of structure func- 
tions. Now observables with errors and correlations can be computed from averages over 
this probability measure, using eq. 0. For example, the average and error of a structure 



FIGURE 1. Left: a neural network. Right: Summary of the NNPDF approach. 

function F(x, Q 2 ) at arbitrary (jc, Q 2 ) can be computed as 

(F(x,Q 2 )) = £i> et )«(*,2 2 ), a(x,Q 2 ) = ^(F(x,Q 2 ) 2 } - (F(x,Q 2 )} 2 . (4) 
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Our strategy is summarized in fig. UJ In fig. |2] we show our results 2 for the proton 
structure function F 2 p (x, Q 2 ), both in our original fit without HERA data ILJ and in the 
latest fit [3] including HERA data. 



3. PARTON DISTRIBUTIONS 

The strategy presented in the above section can be used to parametrize parton distribu- 
tions, provided one now takes into account Altarelli-Parisi QCD evolution. Therefore 
we need to define a suitable evolution formalism, and we will consider for the sake of 
simplicity nonsinglet parton evolution. Since complex neural networks are not allowed, 
we must use the convolution theorem to evolve parton distributions in x— space using the 
inverse T(x) of the Mellin space evolution factor T(N), defined as 

q(N 7 Q 2 )=q(N,Q 2 )T(N 7 a s (Q 2 ),a s (Q 2 )) , (5) 
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The only subtlety is that eq. © defines a distribution, which must therefore be regulated 
at x = 1 , yielding the final evolution equation, 

q{x,Q 2 )=q{x,Ql) j\yT{y)+ j l ^T(y) (?(pGo) G§)) , (V) 



The source code, driver program and graphical web interface for our structure function fits is available 
at http://sophia.ecm.ub.es/f2neural 




FIGURE 2. Left: The proton structure function, both the old version 1 2 ] without HERA data and the new 
version |3] which included HERA data. Right: Preliminary results on the nonsinglet parton distribution 
1ns( x , Qq) compared with other global fits, for Qq = 2 GeV 2 



where in the above equation q(x, Qq) is parametrized using a neural network. At higher 
orders in perturbation theory coefficient functions C(N) are introduced through a modi- 
fied evolution factor, T(N) = T(N)C(N) . We have benchmarked our evolution code with 
the Les Houches benchmark tables |g]. The evolution factor F(x) and its integral are 
computed and interpolated before the neural network training in order to have a faster 
fitting procedure. 

As a first application of our method, we extract the nonsinglet parton distribution 
qNs{*i Qq) = (u + u — d — d)(x, Qq) from the nonsinglet structure function F^ix, Q 2 ) 
measured by the NMC 0] andBCDMS HH 

collaborations. The preliminary results of 
a NLO fit with fully correlated uncertainties can be seen in fig.EJ Our result is consistent 
within the error bands with the results from other global fits II id. I 111 1 1211 in almost all the 
range of Bjorken-x It is clear that the large uncertainties at small x do not allow, within 
the current experimental data, to determine if qNs( x , Q 2 ) grows at small x, as supported 
by different theoretical arguments as well as by other global parton fits. Only additional 
nonsinglet structure function data at small x can settle this issue. 

Summarizing, we have described a general technique to parametrize experimental 
data in an bias-free way with a faithful estimation of their uncertainties, which has been 
successfully applied to structure functions and that now is being implemented in the 
context of global parton distribution fits. 
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